|
Segmented regression, also known as piecewise regression or 'broken-stick regression', is a method in regression analysis in which the independent variable is partitioned into intervals and a separate line segment is fit to each interval. Segmented regression analysis can also be performed on multivariate data by partitioning the various independent variables. Segmented regression is useful when the independent variables, clustered into different groups, exhibit different relationships between the variables in these regions. The boundaries between the segments are ''breakpoints''. Segmented linear regression is segmented regression whereby the relations in the intervals are obtained by linear regression. ==Segmented linear regression, two segments== Segmented linear regression with two segments separated by a ''breakpoint'' can be useful to quantify an abrupt change of the response function (Yr) of a varying influential factor (x). The breakpoint can be interpreted as a ''critical'', ''safe'', or ''threshold'' value beyond or below which (un)desired effects occur. The breakpoint can be important in decision making 〔''Frequency and Regression Analysis''. Chapter 6 in: H.P.Ritzema (ed., 1994), ''Drainage Principles and Applications'', Publ. 16, pp. 175-224, International Institute for Land Reclamation and Improvement (ILRI), Wageningen, The Netherlands. ISBN 90-70754-33-9 . Free download from the webpage () , under nr. 13, or directly as PDF : ()〕 The figures illustrate some of the results and regression types obtainable. A segmented regression analysis is based on the presence of a set of ( y, x ) data, in which y is the dependent variable and x the independent variable. The least squares method applied separately to each segment, by which the two regression lines are made to fit the data set as closely as possible while minimizing the ''sum of squares of the differences'' (SSD) between observed (y) and calculated (Yr) values of the dependent variable, results in the following two equations: * Yr = A1.x + K1 for x < BP (breakpoint) * Yr = A2.x + K2 for x > BP (breakpoint) where: :Yr is the expected (predicted) value of y for a certain value of x; :A1 and A2 are regression coefficients (indicating the slope of the line segments); :K1 and K2 are ''regression constants'' (indicating the intercept at the y-axis). The data may show many types or trends,〔'' Drainage research in farmers' fields: analysis of data''. Part of project “Liquid Gold” of the International Institute for Land Reclamation and Improvement (ILRI), Wageningen, The Netherlands. Download as PDF : ()〕 see the figures. The method also yields two correlation coefficients (R): * for x < BP (breakpoint) and * for x > BP (breakpoint) where: : is the minimized SSD per segment and :Ya1 and Ya2 are the average values of y in the respective segments. In the determination of the most suitable trend, statistical tests must be performed to ensure that this trend is reliable (significant). When no significant breakpoint can be detected, one must fall back on a regression without breakpoint. 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Segmented regression」の詳細全文を読む スポンサード リンク
|